Regularized Mapping to Latent Structures and Its Application to Web Search
نویسندگان
چکیده
Projection to Latent Structures (PLS), also known as Partial Least Squares, is a method for matching objects from two heterogeneous domains. Although PLS is empirically verified effective for matching queries and documents, its scalability becomes a major hurdle for its application in real-world web search. In this paper, we study a general framework for matching heterogeneous objects, which renders a rich family of matching models when different regularization are enforced, with PLS as a special case. Particularly, with l1 and l2 type of regularization on the mapping functions, we obtain the model called Regularized Mapping to Latent Structures (RMLS). RMLS enjoys many advantages over PLS, including lower time complexity and easy parallelization. As another contribution, we give a generalization analysis of this matching framework, and apply it to both PLS and RMLS. In experiments, we compare the effectiveness and efficiency of RMLS and PLS on large scale web search problems. The results show that RMLS can achieve equally good performance as PLS for relevance ranking, while significantly speeding up the learning process.
منابع مشابه
Hierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics
This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...
متن کاملQuery expansion based on relevance feedback and latent semantic analysis
Web search engines are one of the most popular tools on the Internet which are widely-used by expert and novice users. Constructing an adequate query which represents the best specification of users’ information need to the search engine is an important concern of web users. Query expansion is a way to reduce this concern and increase user satisfaction. In this paper, a new method of query expa...
متن کاملPARTICLE SWARM-GROUP SEARCH ALGORITHM AND ITS APPLICATION TO SPATIAL STRUCTURAL DESIGN WITH DISCRETE VARIABLES
Based on introducing two optimization algorithms, group search optimization (GSO) algorithm and particle swarm optimization (PSO) algorithm, a new hybrid optimization algorithm which named particle swarm-group search optimization (PS-GSO) algorithm is presented and its application to optimal structural design is analyzed. The PS-GSO is used to investigate the spatial truss structures with discr...
متن کاملLearning bilinear model for matching queries and documents
The task of matching data from two heterogeneous domains naturally arises in various areas such as web search, collaborative filtering, and drug design. In web search, existing work has designed relevance models to match queries and documents by exploiting either user clicks or content of queries and documents. To the best of our knowledge, however, there has been little work on principled appr...
متن کاملMappings to Realcompactifications
In this paper, we introduce and study a mapping from the collection of all intermediate rings of $C(X)$ to the collection of all realcompactifications of $X$ contained in $beta X$. By establishing the relations between this mapping and its converse, we give a different approach to the main statements of De et. al. Using these, we provide different answers to the four basic questions...
متن کامل